Open Relation Extraction for Polish: Preliminary Experiments

نویسنده

  • Jakub Piskorski
چکیده

This paper presents preliminary experiments on Open Relation Extraction for Polish. In particular, a variant of a priorart algorithm for open relation extraction for English has been adapted and tested on a set of articles from Polish on-line news. The paper provides initial evaluation results, which constitute the point of departure for in-depth research in this area.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lsa Based Extraction of Semantic Similarity for Polish

A recently started project on building the Polish WordNet is the background to the presented work. Here, an initial approach to automatic extraction of semantic similarity relation between Polish lexemes is discussed. The approach follows directly the main lines of Latent Semantic Analysis, but it is modified in order to cope with the rich inflection of Polish and the lack of a corpus comparabl...

متن کامل

Dependency-based Extraction of Entity-relationship Triples from Polish Open-domain Texts

We present a prototype system for extracting arbitrary relations between named entities from open-domain texts in Polish based on DEBORA – a dependency-based approach to the problem. The presented method is designed for the purpose of the conducted experiment and is adapted to morpho-syntactic properties of Polish, e.g. free word order, high degree of morphological marking. Our preliminary resu...

متن کامل

DEBORA: Dependency-Based Method for Extracting Entity-Relationship Triples from Open-Domain Texts in Polish

We present DEBORA – a dependency-based approach to the problem of extraction of arbitrary relations between named entities from open-domain texts in Polish. The presented method is designed for the purpose of the conducted experiment and is adapted to morpho-syntactic properties of Polish, e.g. free word order, high degree of morphological marking. Our preliminary results show that the method i...

متن کامل

Polish-English Word Alignment: Preliminary Study

As word alignment is an important topic in statistical machine translation domain, bilingual dictionary extraction or linguistic information projection studies, a lot of attention has been dedicated to improve its quality. However not all languages are sufficiently represented in these examinations. In the following, we give a description of experiments with the Polish-English word alignment tr...

متن کامل

A New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model

Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015